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Description 

BACKGROUND OF THE INVENTION 

1 . Field of Invention 

[0001] The invention relates generally to computer 
systems. More particularly, methods and apparatus for 
selecting executable instructions in a multi-platform 
computing environment are disclosed. 

2. Description of Relevant Art 

[0002] The continuing proliferation of software plat- 
forms and hardware architectures ensures that both 
computer users and computer program developers will 
encounter many different computing environments in 
the course of their careers. It should be noted that in the 
context of this discussion, the term environment refers 
to the complete range of elements in a computing sys- 
tem that interact with the ported software. These ele- 
ments typically include a processor and operating sys- 
tem as well as I/O devices, libraries, networks, or, in 
some cases, a larger human or physical system. Even 
though a few quasi-standard platforms (e.g. IBM-PC, 
UNIX) have become widely used there, as yet, is no uni- 
versal computing environment. In order to maintain and 
expand their viability, therefore, most software pro- 
grams will eventually face the need to be ported, such 
that an executable version of the software program 
based on the existing version is created in the new com- 
puting environment. Portability, or the ability of a soft- 
ware program to be ported to a given environment (i.e., 
the target) is, therefore, becoming universally recog- 
nized as a desirable attribute for most software pro- 
grams. It is clear, therefore, that portability between dif- 
ferent computing platforms enhances the value of a soft- 
ware program both by extending its useful lifecycle and 
by expanding the range of installations in which it can 
be readily used. As is well known in the art, a software 
program can include an application program, a system 
program, or a component of a program whereas a soft- 
ware system is a collection of software programs. 
[0003] A software program is portable if and to the de- 
gree that the cost of porting is less than the cost of re- 
writing the program in the new target environment. A 
software program would be perfectly portable if it could 
be ported at zero cost and, of course, this is never pos- 
sible in practice. In practice there are two basic porta- 
bility protocols, the first being binary portability (i.e., 
porting the executable form of the software program) 
and the second being source portability (i.e., porting the 
source language representation of the software pro- 
gram). Although binary portability protocols typically of- 
fer several advantages (related primarily to ease of port- 
ing) it can only be used to port software programs across 
strongly similar environments thereby severely limiting 
its usefulness. In contrast, since source portability pro- 



tocols assume availability of a source code, they typi- 
cally provide a greater ability to adapt a particular soft- 
ware program to a wider range of computing environ- 
ments. 

5 [0004] Unfortunately, most of the porting process is 
still done by ad hoc methods that result in inefficient 
techniques that add substantially to the costs of porting 
software from one platform to another. By way of exam- 
ple, a compiler translates a computer program from one 
10 language into another, catching any errors in syntax 
along the way. Most commonly, a compiler translates 
some high level language, such as C++ or COBOL, into 
machine language such that the computer can under- 
stand without any translation. In order to fully port a corn- 
's piler, therefore, several tasks must be accomplished in 
order for the ported compiler to be able to successfully, 
and in a highly reliable manner, perform its designed 
functions while operating in a totally different platform 
than the one it was originally conceived. 
20 [0005] The several tasks required to be accomplished 
in order to fully port a compiler include proper instruction 
selection since generally many different instruction 
types can match the same machine independent se- 
mantics. A simple example is the operation defined as 
25 adding a constant of "I" to a value where the value of 
M 1 " can be represented as either an 8 bit or a 32 bit pre- 
cision integer number. In another example, for an X86 
processor found in the Pentium® and Pentium II® line 
of microprocessors, the floating point unit (FPU) has 3 
30 precision modes in which it can perform various opera- 
tions, such as addition and subtraction. In the case 
where the semantics require rounding to, for example 
24 bit precision, and an FPU control word has set the 
FPU precision to be, for example 53 bits, it would be 
35 inefficient and incorrect for the X86 compiler to select 
instructions defined in the architecture description that 
produce 53 bits of precision without introducing addi- 
tional rounding. 

[0006] Other examples include multiple hardware 
40 platforms, such as the SPARC microprocessor config- 
ured as a V8 or a V9 processor. When configured as a 
V8 system, only V8 type instructions can be executed, 
however, when configured as the V9 system, either V8 
type or V9 type instructions can be executed. Therefore, 
45 it is essential that in those cases where the V8 system 
is operating that only V8 type instructions be selected 
since V9 instructions can not execute on the V8 system. 
[0007] It is also desirable to select not only those in- 
structions that will properly execute on a particular plat- 
50 form, but also select those instructions that improve the 
overall performance of the processor by reducing the 
"cost" of execution. By way of example, storing the re- 
sults of a particular operation, such as a subtraction, in 
a memory location is generally more computer resource 
55 intensive (i.e., more costly) than storing the same result 
in a data register. Therefore, it would be more cost ef- 
fective, where possible, to select the instruction whose 
cost is the least of all those instructions that could pos- 
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sibly be used. Using the example above, it would make 
sense from a cost effectiveness standpoint to select the 
instruction that stores its result in a register as opposed 
to those instructions that store their respective result in 
a location in memory. 

[0008] Therefore, what is desired is the capability of 
defining a selection protocol whereby not only are the 
proper instructions selected to execute in a multi-plat- 
form computing environment, but the cost of executing 
these selected instructions is minimized. 

SUMMARY OF THE INVENTION 

[0009] Broadly speaking, the invention relates to an 
improved method, apparatus and computer system for 
building a compiler having an instruction selector in a 
multi-platform environment. The invention can be imple- 
mented in numerous ways, including as a method, a 
computer system, and an apparatus. Several embodi- 
ments of the invention are discussed below. 
[0010] According to one aspect of the present inven- 
tion, an apparatus for compiling a platform specific com- 
piler having an embedded instruction selector is de- 
scribed. The apparatus includes a set of user defined 
platform dependent compiler architecture descriptors 
that describe corresponding architectural features of a 
particular hardware platform and instruction predicates 
used by the instruction selector. An architecture descrip- 
tor compiler converts the user defined platform depend- 
ent compiler architecture descriptors into the platform 
dependent compiler source code and instruction selec- 
tor source which is converted into platform dependent 
object code and instruction selector object code by a 
host compiler. The compiler is formed from platform in- 
dependent compiler object code and the platform de- 
pendent compiler object code in combination with the 
instruction selector object code. 
[0011] As a method for building a platform specific 
compiler, a set of user defined platform dependent com- 
piler architecture descriptors that describe correspond- 
ing architectural features of a particular hardware plat- 
form dependent compiler and instruction predicates are 
provided. The descriptors and instruction predicates are 
converted into platform dependent compiler source 
code and instruction selector source code by an archi- 
tecture descriptor compiler. The platform dependent 
compiler source code and instruction selector source 
code is compiled into platform dependent object code 
and instruction selector object code. The platform spe- 
cific compiler having the embedded instruction selector 
is formed from the platform dependent object code, the 
instruction selector object code, and platform independ- 
ent compiler object code. 

[0012] In another embodiment, a platform specific 
compiler is disclosed. The compiler includes platform 
dependent compiler object code having embedded in- 
struction selector object code and platform independent 
compiler object code which are suitable for execution 



on a particular hardware platform. An interface that is 
partially embedded in the platform independent code 
and partially embedded in the platform dependent ob- 
ject code mediates flow of information between the plat- 
5 form independent compiler code and the platform de- 
pendent compiler code during platform specific compiler 
run time. 

[001 3] These and other advantages of the present in- 
vention will become apparent upon reading the following 
10 detailed descriptions and studying the various figures of 
the drawings. 

BRIEF DESCRIPTION OF THE DRAWINGS 

is [001 4] The invention, together with further advantag- 
es thereof, may best be understood by reference to the 
following description taken in conjunction with the ac- 
companying drawings in which: 

20 Fig. 1 is a representative block diagram of a multi- 
platform compiler system in accordance with an em- 
bodiment of the invention; 

Fig. 2 illustrates a particular implementation of the 
multi-platform compiler shown in Fig 1; 
25 Fig. 3 shows another implementation of the compil- 
er shown in Fig. 2; 

Fig. 4 shows a flowchart detailing a compiler build- 
ing process in accordance with an embodiment of 
the invention; 

30 Fig. 5 illustrates a Java Virtual Machine (JVM) hav- 
ing a platform specific compiler in accordance with 
an embodiment of the invention; 
Fig. 6A illustrates an exemplary AD file organization 
in accordance with an embodiment of the invention; 
35 Fig. 6B illustrates a particular relationship between 
various data fields included in the AD file shown in 
Fig. 6A; 

Fig. 7 illustrates an exemplary interface coupling 
platform dependent source code and platform inde- 
40 pendent source code in accordance with an embod- 
iment of the invention; 

Fig. 8 is an exemplary representation of a run-time 
process by the compilation engine in accordance 
with an embodiment of the invention; 
15 Fig. 9 is a representation of a machine independent 
instruction; 

Fig. 10 is flowchart detailing the instruction selec- 
tion process in accordance with an embodiment of 
the invention; and 
so Fig. 11 illustrates a computer system employed to 
implement the invention. 

DETAILED DESCRIPTION OF THE EMBODIMENTS 

55 [0015] In the following description, frameworks and 
methods of selecting instructions in a multi-platform 
computing environment are described. The invention 
will initially be described in terms of a multi-platform 
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compiler residing in a Java virtual machine. In general, 
in order to build a platform specific compiler having an 
embedded instruction selection code, a set of user de- 
fined platform specific architecture descriptors in the 
form of an architecture description language (ADL) file 5 
and a set of user defined instruction predicates are pro- 
vided. It should be noted that the ADL can take many 
forms well known to those skilled in the art such as C++, 
Attribute Grammars, Custom Description Languages, 
etc., or some combination of these forms. In the de- 
scribed embodiment, the instruction predicates take the 
form of boolean expressions that in conjunction with in- 
struction selection code embedded in the platform spe- 
cific compiler, enable only those instructions that are to 
be executed. In one embodiment of the invention, the 
determination of the boolean expression's value may 
only be possible during instruction selection. 
[0016] The AD file includes the user defined instruc- 
tion predicates that form an input to a multi-platform 
compiler builder that includes an architecture design 
language compiler (ADLC). In one embodiment, the 
ADLC is used to generate particular target platform de- 
pendent compiler source code used to build the target 
specific compiler. The ADLC also generates target plat- 
form instruction selection source code and target plat- 
form instruction predicate source code. The platform 
specific compiler having an embedded instruction se- 
lector is built using the platform dependent compiler 
source code, the instruction selection source code, and 
the instruction predicate source code provided by the 
ADLC in combination with the platform independent 
compiler object code otherwise provided. 
[00171 ' n this way, the platform specific compiler se- 
lects not only those instructions deemed by the user to 
be suitable for execution on the target platform but those 
instructions which improve processor performance by 
reducing instruction execution costs. In this way, the re- 
liability and performance of the processor in a multi-plat- 
form environment are substantially improved. 
[0018] Fig. 1 is a representative block diagram of a 
multi-platform compiler system 100 in accordance with 
an embodiment of the invention. The multi-platform 
compiler system 100 is capable of building, or compiling, 
a platform specific compiler capable of selecting instruc- 
tions appropriate to the target platform. In addition, the 
compiler is also capable of selecting those instructions 
that improve overall processor performance by reducing 
instruction execution cost. 

[0019] In the described embodiment, the platform 
specific compiler is built by compiling platform inde- 
pendent source code representing those portions of the 
platform specific compiler that are independent of any 
particular platform and platform dependent source code 
representing those features of the compiler that are plat- 
form specific. In those cases where user defined (ex- 
plicit) instruction predicates are provided, or where it is 
determined that particular instructions require instruc- 
tion predicates regardless whether or not they are user 



supplied (implicit), the compiler is built using instruction 
predicate source code that provides the platform specif- 
ic compiler with the aforementioned selection capability. 
[0020] More particularly, in the described embodi- 
ment, the multi-platform compiler system 100 includes 
a compiler builder 1 02 arranged to build the platform de- 
pendent compiler using platform specific architecture 
descriptors and instruction predicates. In the described 
embodiment, the platform specific architecture descrip- 
tors take the form of architecture description language 
(ADL) used to represent those platform dependent por- 
tions of the target platform compiler while the instruction 
predicates take the form of boolean expressions. It 
should be noted that the ADL can take many forms well 
known to those skilled in the art such as C++, Java 
source code, Pascal, etc. Typically, it is the AD file that 
is coded by the user of the multi-platform compiler sys- 
tem 1 00, however, in some cases, the AD file is provided 
by original equipment manufacturers in situations re- 
ferred to as "turn key 8 systems. In these situations, the 
end user has selected which target platforms are 
deemed to be useful and the supplier has provided the 
necessary coding efforts. 

[0021] In many instances, the AD file is located in, for 
example, memory systems external to the compiler 
builder 102 such as an AD file 104 and an AD file 106. 
It should be noted that although any number of AD files 
can be provided, each specific to a particular platform, 
however, only one AD file at a time is processed by the 
compiler builder 1 02. In this way, when platform depend- 
ent source code specific to, for example, a platform type 
1 is provided, the customized compiler builder 102 is ca- 
pable of compiling source code into platform type 1 ob- 
ject code. For example, the AD file 1 04 can include ADL 
code having instruction predicates used by the compiler 
builder 1 02 to build an X86 compiler capable of prefer- 
entially selecting those instructions enabled by the user 
defined instruction predicates. As well known in the art, 
X86 object code are those instructions executable by an 
X86 microprocessor manufactured by the Intel Corpo- 
ration of Santa Clara, CA. 

[0022] Along the same lines, the AD file 106 can in- 
clude ADL code having instruction predicates used by 
the compiler builder 1 02 to build, for example, a SPARC 
compiler capable of preferentially selecting those in- 
structions enabled by the user defined instruction pred- 
icates. 

[0023] In both cases, the overall performance of the 
processor can be improved since a cost analysis is also 
performed substantially simultaneously with the selec- 
tion. In this way, the compiler builder 102 is capable of 
automatically providing as many of the platform specific 
compilers having appropriate instruction selection code 
as there are AD files and corresponding user defined 
instruction predicates as are available. 
[0024] Referring now to Fig. 2 illustrating a particular 
implementation of the multi-platform compiler builder 
1 02 shown in Fig 1 . In the embodiment shown, the com- 
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piler builder 102 includes an architecture description 
language compiler 202 (ADLC) coupled to an AD file 
204 having user defined instruction predicates file 201 . 
The ADLC 202 is arranged to compile the platform spe- 
cific ADL code included in the AD file 204 and the in- 5 
struction predicates included in the file 201 to platform 
dependent compiler source code, instruction selector 
source code, and instruction predicate source code, re- 
spectively. In addition, the ADLC 202 is capable of gen- 
erating instruction predicates that are not explicitly writ- 
ten in the AD file 204 in order to ensure a correct or ef- 
ficient instruction selection, heretofore referred to as im- 
plicit instruction predicates. 

[0025] The platform dependent compiler source code, 
the instruction selector source code, and the instruction 
predicate source code are, in turn, provided to a host 
compiler 203 coupled to the ADLC 202. In the described 
embodiment, the host compiler 203 is a C++ compiler 
well known to those skilled in the art. However, any com- 
piler suitably arranged to compile source code to target 
specific compiler object code can be used. 
[0026] During compile time, the host compiler 203 
coupled to the ADLC 202 compiles the platform depend- 
ent source code in combination with the platform de- 
pendent instruction predicates source code and instruc- 
tion selector source code to form the platform depend- 
ent compiler object code having embedded instruction 
selector object code. In the described embodiment, the 
platform dependent compiler object code is represented 
by a code block 206 that includes the embedded instruc- 
tion selector object code represented by a code block 
216 included in a compiler unit 208 coupled to the host 
compiler 203. The embedded instruction selector object 
code provides the customized compiler unit 208 with the 
capability of selecting those instructions deemed appro- 
priate by the user. In addition, by performing a cost anal- 
ysis, those instructions which reduce instruction execu- 
tion cost are preferentially selected thereby improving 
processor performance by reducing overall instruction 
execution cost. 

[0027] In some cases, the platform independent com- 
piler object code is derived from platform independent 
compiler source code compiled by the host compiler 
203. In other cases, such as the described embodiment, 
the platform independent compiler object code is al- 
ready provided. In the described embodiment, the plat- 
form independent object code is represented by a code 
block 210 that includes, in one embodiment, a compila- 
tion engine 212 coupled to a platform independent in- 
terface 214. In a particular implementation, during what 
is referred to as run-time, the interface 214 mediates the 
transfer of information between selected portions of the 
platform dependent object code and instruction selec- 
tion object code to the compilation engine 212. 
[0028] By way of example, when an X86 microproc- 
essor is required, the user provides, for this example, 
the AD file 204 having stored therein the appropriate 
ADL code specific to the X86 platform and the file 201 



having the appropriate instruction predicates. As direct- 
ed by the compiler builder 102, both the X86 specific 
ADL code and the instruction predicates are supplied to 
the ADLC 202. The ADLC 202 then compiles the ADL 
code to X86 specific compiler source code and the in- 
struction predicates to corresponding platform specific 
instruction predicates source code and instruction se- 
lector source code. It should be noted that the ADLC 
202 is a universal compiler capable of converting any 
properly constructed AD file into corresponding platform 
specific compiler source code and, if required, platform 
dependent instruction predicate source code. In this 
way, any requirements that the user code, or in any way 
modify the ADLC 202 or any portion thereof are sub- 
stantially eliminated. Therefore, the only coding re- 
quired of the user is that required to provide a properly 
constructed and verified AD file that can include, if nec- 
essary, instruction predicates. 

[0029] Once the ADLC 202 has compiled the X86 
specific ADL code to X86 compiler source code and the 
instruction predicates into instruction selector source 
code, they are compiled to object code using host com- 
piler 203 and represented by the blocks 206 and 216, 
respectively. 

[0030] In some cases, multiple platform dependent 
compiler source code files are made available to a com- 
piler unit 300 shown in Fig. 3. The compiler unit 300 is 
one implementation of the compiler unit 208 shown in . 
Fig. 2 and, as such, should only be consider exemplary 
in nature. In the described embodiment,. the various plat- 
form dependent AD files are stored in a file stack 301 . 
The file stack 301 can be local to the compiler unit 300 
or, in some cases, can be remotely located in, for exam- 
ple, data bases, remote servers, etc. This arrangement 
is particularly well suited for applications involving trans- 
ferring data over coupled computer networks, such as 
the Internet, local area networks (LANs), and the like. 
This use of multiple AD files is particularly advantageous 
since it provides the compiler unit 300 with the capability 
of operating, as needed, as any platform specific com- 
piler represented by the corresponding AD file. By way 
of example, the file stack 301 includes platform an AD 
file 302 representing, for example, SPARC specific de- 
scriptors along with their corresponding instruction 
predicates which is converted to corresponding platform 
dependent source code 304 and instruction selector 
306. With this arrangement, any platform having its cor- 
responding AD file included in the file stack 301 could 
be selected to customize the compiler unit 300. Al- 
though not shown, a selector is typically employed to 
select which AD file is to be input to the ADLC 202. 
[0031] As discussed above, the compiler builder 102 
builds a particular compiler having an instruction selec- 
tor, using in one embodiment, a process 400 detailed by 
the flowchart shown in Fig. 4. The process 400 begins 
at 402 by providing platform specific architecture de- 
scriptors in the form of ADL code stored in, for example, 
an AD file. At 404 a determination is made whether or 
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not the AD file includes user defined instruction predi- 
cates (explicit) or whether or not implicit instruction pred- 
icates are required. Implicit instruction predicates are re- 
quired when, for example, it is essential for proper in- 
struction execution that particular data be present in par- 
ticular locations. Such a situation occurs with those in- 
structions having multiple occurrences of a particular 
operand where it is essential that each operand must 
be the same otherwise a processing error is likely to oc- 
cur. Since the risks of error is likely to vary from platform 
to platform (i.e., executing a 3 address instruction on a 
2 address platform versus executing a 2 address in- 
struction on a 3 address platform), relying on the user 
to comprehend this fact is risky. In addition, an instruc- 
tion predicate can require the use of implementation de- 
tails of the compiler unit that may not be available to the 
writer of the AD file. Therefore, implicit instruction pred- 
icates are a part of the instruction selection process in 
that those particular instructions requiring special con- 
sideration are provided with the appropriate instruction 
predicate by the ADLC. 

[0032] When explicit and/or implicit instruction predi- 
cates are required, the ADLC provides the instruction 
selector source code at 406. Substantially simultane- 
ously with 406, the ADLC provides platform specific 
compiler source code at 408 based upon the AD file in- 
puts, whereas, platform independent compiler source 
code is provided at 410. The host compiler then builds 
the platform specific compiler having instruction selec- 
tion and execution cost analysis capability by compiling 
the platform independent source code, the platform de- 
pendent compiler source code, and the instruction pred- 
icates source code concurrently at 412. 
[0033] Once the platform specific compiler has been 
built at 414, the platform specific compiler is available 
during run-time for selecting those cost effective instruc- 
tions that are compiled into platform specific object code 
as needed. In a particular embodiment, during run-time, 
the platform independent interface mediates the flow of 
information between the compilation engine and the 
platform dependent compiler source code and instruc- 
tion predicate source code. 

[0034] More recently, the Java programming lan- 
guage, an object-oriented language, has introduced the 
possibility of compiling output (called bytecode) that can 
run on any computer system platform for which a Java 
virtual machine (or bytecode interpreter) is provided. 
The Java virtual machine is designed to convert the 
bytecode into instructions that can be executed by the 
actual hardware processor. Using this virtual machine, 
rather than being interpreted one instruction at a time, 
bytecode can be recompiled at each particular system 
platform by, in some cases, a just-in-time (JIT) compiler. 
[0035] Fig. 5 illustrates an apparatus that includes a 
Java Virtual Machine (JVM) 500 incorporating the com- 
piler unit 208 in accordance with an embodiment of the 
invention. In the described arrangement, a platform spe- 
cific AD file stack 502 coupled to the ADLC 202 includes 



a group of AD files each representing particular platform 
dependent compiler features. In the described embodi- 
ment, the AD file stack 502 includes the AD file 104 and 
AD file 1 06 representing the X86 and SPARC architec- 

5 tures, respectively. In the case where a number of dif- 
ferent AD files are included in the AD file stack 502, a 
selector unit (not shown) is typically used to select a par- 
ticular AD file from the AD file stack 502 corresponding 
to the desired operating platform. When the appropriate 

10 AD file is selected, the ADLC 202 converts the ADL code 
included in the selected AD file into appropriate platform 
dependent compiler source code as discussed above. 
[0036] In the Java programming language and envi- 
ronment, a just-in-time (JIT) compiler is a program that 

15 turns Java bytecode into instructions that can be sent 
directly to the processor. After a Java program has been 
written, the Java source language statements are com- 
piled by the Java compiler into Java bytecode rather 
than into code that contains instructions that match a 

20 particular hardware platform's processor (for example, 
an Intel Pentium microprocessor or an IBM System/390 
processor). The Java bytecode is platform-independent 
code that can be sent to any platform and run on that 
platform. 

25 [0037] More particularly, when bytecodes are provid- 
ed to a JIT compiler provided by the compiler unit 208, 
the compilation of methods contained in bytecodes 504 
is delayed until the methods are about to be executed. 
When bytecodes 504 are provided to an interpreter 506, 
30 bytecodes 504 are read into interpreter 506 one byte- 
code at a time. Interpreter 506 then performs the oper- 
ation defined by each bytecode as each bytecode is 
read into interpreter 506. That is, interpreter 506 "inter- 
prets" bytecodes 504, as will be appreciated by those 
35 skilled in the art. In general, interpreter 506 processes 
bytecodes 504 and performs operations associated with 
bytecodes 504 substantially continuously. 
[0038] When a method is compiled, the compiler unit 
208 generates machine instructions as selected by an 
40 instruction selector 508. The compiler unit 208 then gen- 
erates machine instructions from the selected byte- 
codes 504, and the resulting machine language instruc- 
tions may be executed directly by the target platform op- 
erating system 510. In general, the machine-language 
is instructions are discarded when virtual machine 500 ter- 
minates. 

[0039] Referring now to Fig. 6 illustrating the organi- 
zation of an AD file. 600 in accordance with an embodi- 
ment of the invention. It should be noted that the organ- 
50 ization shown is one of the many possible organizations 
that AD file 1 04 can take. In the described embodiment, 
the AD file 600 is a hierarchically organized set of dis- 
tinct platform architecture descriptor data fields. By way 
of example, a register definition data field 602 is used 
55 by the ADLC to describe individual registers and classes 
of registers with the target architecture. An encoding 
block data field 604 specifies the encoding classes used 
by the target compiler to output byte streams. A frame 
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management block data field 606 Includes information 
that defines the frame structure and management pro- 
tocols. Such information defines, for example, what di- 
rection the frame stack grows, the number of stack slots 
consumed by a monitor enter operation, stack alignment 
requirements, number of stack slots reserved for "Top 
of Stack", amongst others. 

[0040] An operand data field 608 provides operand 
definitions that must precede instruction definitions for 
correct compilation in the ADLC since operands consti- 
tute user defined types which are used in instruction def- 
initions. A pipeline rules data field 610 is provided to de- 
fine the behavior of the target architecture pipeline. An 
instruction definitions data field 612 provides instruction 
formats for the target architecture as well as corre- 
sponding instruction predicates. A peephole data field 
614 provides target architecture specific optimization 
rules used by the ADLC. 

[0041] The hierarchical organization of the AD file 600 
underlies the interrelationship amongst the various AD 
input data fields. Fig. 6B graphically illustrates one such 
relationship, specifically, the relationship between the 
various operands included in the instruction definitions 
data field 612. By performing an backwards traversal 
from using the instruction definitions data field 612 as 
the root, the relational dependencies for the various op- 
erands required to be input to the AD input data field is 
determinable. For example, by performing an upward 
traversal starting from the instruction definition data field 
61 2 as the root and extending upward along the various 
branches, the pipeline rules data field 610 and operand 
definitions data field 608 are encountered. Performing 
an upward traversals from the from the pipeline rules 
data field 612 is the register definitions data field 602, 
while the register definitions data field 602 and the en- 
coding class data field 608 are encountered when an 
upwards traversal from the operand definitions data field 
608 is performed. In this way, the various operands re- 
quired to fully define a particular instruction definition is 
provided. 

[0042] Fig. 7 illustrates an exemplary interface 700 
used by the platform dependent compiler in accordance 
with an embodiment of the invention. The interface 700 
mediates communication between the compiler engine 
212 (or the platform independent object code) and the 
platform dependent object code in the block 206. In the 
embodiment shown, the ADLC output includes object 
code for a deterministic finite automaton (DFA) 702 that 
specifies the mapping from ideal operations to machine 
instructions. The ADLC output also includes object code 
that defines a set of instruction classes 704 that are used 
to, for example, define legal register masks, encoding 
methods, branch offset behavior, etc. Object code for a 
peephole rules orac!e706 that specifies machine specif- 
ic trees that are legal to optimize and what the correct 
replacement is for those trees is also output by the 
ADLC. In this way, the platform specific architecture 
characteristics are automatically provided in a format 



suitable for the compilation engine 212 to use in com- 
piling source code into platform dependent object code. 
[0043] The target independent portion of the interface 
700 is coupled to a matcher 708 which generates input 
5 trees to be processed by the DFA 702 that performs bot- 
tom up rewrite rule system (BURS) style tree pattern 
matching in order to select machine instructions for ideal 
operations in the intermediate representation. 
[0044] The interface 700 also includes object code ar- 
io range to act as a peephole DFA that performs tree pat- 
tern matching to find optimization candidates and re- 
places matched trees of machine instructions with opti- 
mized trees of machine instructions. A matcher 708 per- 
forms instruction selection using the matcher DFA and 
15 builds machine specific intermediate representations. A 
scheduler 710 orders the machine specific intermediate 
representations while a register allocator 712 selects a 
legal assignment of registers to operands in the ma- 
chine specific representation. This includes the insertion 
of any instructions necessary to relocate values to prop- 
er locations (such as moving arguments to their appro- 
priate location specified by the calling convention). A 
peephole optimizer 714 pattern matches small trees in 
the machine specific representation and replaces them 
with more optimal machine specific trees. An object 
code output 716 uses virtual calls to encode machine 
specific representation as machine object code in a buff- 
er and makes the buffer available to the virtual machine. 
[0045] Fig. 8 is an exemplary representation of an in- 
struction compilation process carried out during run- 
time by the compilation engine 212 in accordance with 
an embodiment of the invention. During the run-time, the 
compilation engine 212, when required, generates an 
information request (referred to, in a particular embodi- 
ment, as an emit function call) for specific information 
from the platform dependent compiler object code and, 
if necessary, the instruction selector object code. By way 
of example, when the compilation engine 212 requires 
platform specific information in order to select an in- 
struction using the embedded instruction selector, the 
compilation engine 212 executes a function call to a in- 
formation requestor 802. The information requestor 
802, in turn, is coupled to the interface 700 which directs 
the function call to a information retriever 804 coupled 
to the DFA having the required instruction predicates. In 
the described embodiment, the DFA then selects the in- 
struction based upon the corresponding instruction 
predicates. The information retriever 804, in turn, re- 
sponds to the function call by retrieving the requested 
information. It should be noted that the retrieved infor- 
mation is structured in such a manner as to be readily 
used by the compilation engine 212. 
[0046] Fig. 9 represents an exemplary machine inde- 
pendent graph 900 of an instruction having multiple re- 
currences of an operand in accordance with an embod- 
iment of the invention. The machine independent graph 
900 is mapped to those machine dependent instructions 
having the same semantics. For example, the graph 900 
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node 902 represents at the most basic level a subtrac- 
tion operation that would map to any number of machine 
dependent subtraction operation (subA, subB, subC) 
having the same semantics corresponding to subtrac- 
tion. However, as discussed previously, not all of the 
possible mappings are desirable since some may in fact 
result in processing errors. This can occur, for example, 
in those situations where the target platform is a 2 ad- 
dress processor (such as an X86) and the result of the 
subtraction operation is required to be stored into a lo- 
cation distinct from that of the inputs (as can be done by 
3 address instructions in the SPARC architecture). Typ- 
ically, a bottom up matching process is used to break 
down the binary tree representation of the machine in- 
struction into semantics that the target platform can un- 
derstand and execute. In this example, each node of the 
tree is mapped to a target dependent instruction as se- 
lected by the instruction predicate using the instruction 
selection code generated by the ADLC. 
[0047] The root instruction node 902 indicates that the 
instruction 900 is an integer subtraction operation hav- 
ing as inputs the results of node 904 and node 906. The 
result of the subtraction operation is stored in a register 
at 908. In this example, the instruction selector selects 
the most cost effective instruction or sequence of in- 
structions out of all matching instructions to be execut- 
ed. 

[0048] Fig. 10 is a flowchart detailing an instruction 
selection process 1000 in accordance with an embodi- 
ment of the invention. The process 1 000 begins at 1 002 
by locating a node in the machine independent binary 
tree representation of the instruction. At 1004, the in- 
struction selection code corresponding to the located 
node is obtained and executed. The next potential user 
defined instruction that can matches the located node 
is then checked at 1006. This matching is based upon 
matching comparing the semantics between the ma- 
chine independent representation of the node and the 
target platform representation of the node. At 1008, a 
determination is made whether or not the location of in- 
puts of the user defined instruction matches the location 
of inputs for the located node. If it is determined that 
there is match, then a determination is made at 1010 
whether or not the instruction predicate is satisfied. If 
the instruction does not match or the instruction predi- 
cate is not satisfied, then control is passed back to 1 006 
where the next potential user defined instruction that 
matches the node is selected. 

[0049] Returning back to 1 01 0, if the instruction pred- 
icate is satisfied, then an estimate of the execution cost 
is made for the user defined instruction at 1012. A de- 
termination is made at 1014 whether or not the cost es- 
timate is less than the previous cost estimate. If the cost 
estimate is less, then the cost estimate variable is up- 
dated at 1 01 6 and the best instruction is updated at 1 01 8 
after which a determination at 1020 is made whether or 
not there are more user defined matching instructions. 
If there are no matching instructions then the process 



stops, however, if there are more matching instructions, 
then control is passed back to 1 006. 
[0050] Returning to 1014, if it is determined that the 
cost estimate is not less, then if there are additional 
5 matching instructions, control is passed back to 1006, 
otherwise the process stops. 

[0051 ] Fig. 1 1 illustrates a computer system 1 1 00 em- 
ployed to implement the invention. The computer sys- 
tem 1100 or, more specifically, CPUs 1102, may be ar- 

10 ranged to support a virtual machine, as will be appreci- 
ated by those skilled in the art. As is well known in the 
art, ROM acts to transfer data and instructions unidirec- 
tionally to the CPUs 1102, while RAM is used typically 
to transfer data and instructions in a bi-directional man- 

15 ner. CPUs 1102 may generally include any number of 
processors. Both primary storage devices 1104, 1106 
may include any suitable computer-readable media. A 
secondary storage medium 1108, which is typically a 
mass memory device, is also coupled bi-directionally to 

20 CPUs 1 1 02 and provides additional data storage capac- 
ity. The mass memory device 1108 is a computer- read- 
able medium that may be used to store programs includ- 
ing computer code, data, and the like. Typically, mass 
memory device 1108 is a storage medium such as a 

25 hard disk or a tape which generally slower than primary 
storage devices 1104, 1106. Mass memory storage de- 
vice 1 1 08 may take the form of a magnetic or paper tape 
reader or some other well-known device. It will be ap- 
preciated that the information retained within the mass 

30 memory device 1108, may, in appropriate cases, be in- 
corporated in standard fashion as part of RAM 1106 as 
virtual memory. A specific primary storage device 1104 
such as a CD-ROM may also pass data unidirectionally 
to the CPUs 1102. 

35 [0052] CPUs 1102 are also coupled to one or more 
input/output devices 1110 that may include, but are not 
limited to, devices such as video monitors, track balls, 
mice, keyboards, microphones, touch-sensitive dis- 
plays, transducer card readers, magnetic or paper tape 

40 readers, tablets, styluses, voice or handwriting recog- 
nizers, or other well-known input devices such as, of 
course, other computers. Finally, CPUs 1102 optionally 
may be coupled to a computer or telecommunications 
network, e.g., an Internet network or an intranet net- 

45 work, using a network connection as shown generally 
at 1112. With such a network connection, it is contem- 
plated that the CPUs 1102 might receive information 
from the network, or might output information to the net- 
work in the course of performing the above-described 

50 method steps. Such information, which is often repre- 
sented as a sequence of instructions to be executed us- 
ing CPUs 1102, may be received from and outputted to 
the network, for example, in the form of a computer data 
signal embodied in a carrier wave. The above-described 

55 devices and materials will be familiar to those of skill in 
the computer hardware and software arts. 
[0053] Although only a few embodiments of the 
present invention have been described, it should be un- 
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derstood that the present invention may be embodied 
in many other specific forms without departing from the 
spirit or the scope of the present invention. By way of 
example, the multi-platform compiler can be used in any 
computing system. 

[0054] Although the methods of porting a compiler 
from one operating system to another, different operat- 
ing system in accordance with the present invention are 
particularly suitable for implementation with respect to 
a Java™ based environment, the methods may gener- 
ally be applied in any suitable object-based environ- 
ment. In particular, the methods are suitable for use in 
platform-independent object-based environments. It 
should be appreciated that the methods may also be im- 
plemented in some distributed object-oriented systems. 
[0055] While the present invention has been de- 
scribed as being used with a distributed object based 
computer system, it should be appreciated that the 
present invention may generally be implemented on any 
suitable computing system having a compiler. There- 
fore, the present examples are to be considered as il- 
lustrative and not restrictive, and the invention is not to 
be limited to the details given herein, but may be modi- 
fied within the scope of the appended claims along with 
their full scope of equivalents. 



Claims 

1. An apparatus for generating a platform specific 
compiler having an embedded instruction selector, 
comprising: 

a set of user defined platform dependent com- 
piler architecture descriptors that describe cor- 
responding architectural features of a particular 
hardware platform dependent compiler; 
a set of instruction predicates used to identify 
platform specific instructions selected by the in- 
struction selector; 

an architecture descriptor compiler arranged to 
convert the user defined platform dependent 
compiler architecture descriptors into the plat- 
form dependent compiler source code and ar- 
ranged to convert the set of instruction predi- 
cates into platform specific instructor selector 
source code; 

a host compiler arranged to compile the plat- 
form dependent compiler source code into plat- 
form dependent compiler object code and ar- 
ranged to compile the platform specific instruc- 
tor selector source code into the embedded in- 
struction selector object code; 
platform independent compiler object code; 
and 

an interface arranged to mediate the flow of in- 
formation between the platform dependent 
compiler object code and the platform inde- 



pendent compiler object code during run time 
for the platform specific compiler, wherein dur- 
ing platform specific compiler run time, the em- 
bedded instruction selector selects the instruc- 

5 tion to be compiled based upon the execution 

cost of the selected instruction, and wherein the 
embedded instruction selector provides implicit 
instruction predicates used by the platform spe- 
cific compiler to compile the selected instruc- 

10 tion. 

2. An apparatus as recited in claim 1 , wherein the plat- 
form specific compiler includes platform independ- 
ent compiler object code and platform dependent 

15 compiler object code suitable for execution of the 
particular hardware platform. 

3. An apparatus as recited in claim 2, wherein during 
platform specific compiler run time, the platform in- 

20 dependent compiler object code requests specific 
platform dependent object code information by pro- 
viding an information request to the interface which 
directs the information request to a pre-determined 
platform dependent compiler object code informa- 

25 tion retriever. 

4. An apparatus as recited in claim 3, wherein the plat- 
form dependent compiler object code information 
retriever responds to the information request by re- 

30 trieving specific platform dependent compiler object 
code information in satisfaction of the information 
request. 

5. An apparatus as recited in claim 4, wherein the re- 
35 trieved information is provided to the interface 

which, in turn, directs the information to the infor- 
mation requestor. 

6. An apparatus as recited in claim 1 , wherein said ap- 
40 paratus comprises a plurality of sets of user defined 

platform dependent architecture descriptors, 
wherein each of which corresponds to a different 
hardware platform. 

45 7. A method of building a platform specific compiler 
having an embedded instruction selector, compris- 
ing: 

providing a set of user defined platform de- 
50 pendent compiler architecture descriptors that 

describe corresponding architectural features 
of a particular hardware platform dependent 
compiler; 

providing a set of user defined instruction pred- 
55 icates used by the embedded instruction selec- 

tor to select those instructions to be compiled 
by the platform specific compiler during run 
time; 
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converting the set of user defined platform de- 
pendent compiler architecture descriptors into 
platform dependent compiler source code by 
an architecture descriptor compiler; 
converting the set of user defined instruction 5 
predicates into platform instruction selector 
source code by an architecture descriptor com- 
piler; 

compiling the platform dependent compiler 
source code into platform dependent object io 
code by a host compiler coupled to the archi- 
tecture descriptor compiler; 
compiling the instruction selector source code 
into instruction selector object code by a host 
compiler coupled to the architecture descriptor 15 
compiler; 

providing platform independent compiler object 
code, wherein the platform independent com- 
piler object code and the platform dependent 
compiler object code are suitable for execution 20 
on the particular hardware platform; and 
forming the embedded instruction selector from 
the platform dependent compiler object code, 
the instruction selector object code, and the 
platform independent compiler object code. 25 

8. A method as recited in claim 7, further comprising: 

requesting specific platform dependent object 
code information by the platform independent 30 
compiler object code by the platform independ- 
ent object code during platform specific compil- 
er run time; 

providing an information request to the inter- 
face; and 35 
directing the information request to a pre-deter- 
mined platform dependent compiler object 
code information retriever by the interface. 

9. A method as recited in claim 8, further comprising: 40 

retrieving specific platform dependent compil- 
er object code information in satisfaction of the in- 
formation request in response to the request by the 
information retriever. 

45 

10. A method as recited in claim 9, further comprising: 

directing the retrieved information to the infor- 
mation requestor by the interface. 



wherein the platform independent compiler ob- 
ject code and the platform dependent compiler 
object code are suitable for execution on a par- 
ticular hardware platform; 
a platform dependent instruction selector ob- 
ject code embedded in the platform dependent 
compiler object code; 

an interface partially embedded in the platform 
independent code and partially embedded in 
the platform dependent object code, wherein 
during platform specific compiler run time, the 
interface mediates flow of information between 
the platform independent compiler code and 
the platform dependent compiler code. 

, 13. A compiler as recited in claim 12, wherein during 
platform specific compiler run time, the platform in- 
dependent compiler object code requests specific 
platform dependent object code information by pro- 
viding an information request to the interface which 
directs the information request to a pre-determined 
platform dependent compiler object code informa- 
tion retriever. 

14. An apparatus as recited in claim 13, wherein the 
platform dependent compiler object code informa- 
tion retriever responds to the information request by 
retrieving specific platform dependent compiler ob- 
ject code information in satisfaction of the informa- 
tion request. 

15. An apparatus as recited in claim 14, wherein the re- 
trieved information is provided to the interface 
which, in turn, directs the information to the infor- 
mation requestor. 

16. An apparatus as recited in claim 12, wherein said 
apparatus comprises a plurality of sets of user de- 
fined platform dependent architecture descriptors, 
wherein each of which corresponds to a different 
hardware platform. 



11 . A method as recited in claim 10, further comprising: so 

directing the retrieved information to the infor- 
mation requestor by the interface. 

12. A platform specific compiler having an embedded 
instruction selector, comprising: ss 

a platform dependent compiler object code; 
a platform independent compiler object code, 
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